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Abstract 

Background: Aluminum (Al) toxicity is an important limitation to food security in tropical and subtropical regions. 
High Al saturation on acid soils limits root development, reducing water and nutrient uptake. In addition to 
naturally occurring acid soils, agricultural practices may decrease soil pH, leading to yield losses due to Al toxicity. 
Elucidating the genetic and molecular mechanisms underlying maize Al tolerance is expected to accelerate the 
development of Al-tolerant cultivars. 

Results: Five genomic regions were significantly associated with Al tolerance, using 54,455 SNP markers in a 
recombinant inbred line population derived from Cateto AI237. Candidate genes co-localized with Al tolerance QTLs 
were further investigated. Near-isogenic lines (NILs) developed for ZmMATE2 were as Al-sensitive as the recurrent 
line, indicating that this candidate gene was not responsible for the Al tolerance QTL on chromosome 5, qALT5. 
However, ZmNratl, a maize homolog to OsNratl, which encodes an Al 3+ specific transporter previously implicated in 
rice Al tolerance, was mapped at -40 Mbp from qALT5. We demonstrate for the first time that ZmNratl is preferentially 
expressed in maize root tips and is up-regulated by Al, similarly to OsNratl in rice, suggesting a role of this gene in 
maize Al tolerance. The strongest-effect QTL was mapped on chromosome 6 {qALT6), within a 0.5 Mbp region where 
three copies of the Al tolerance gene, ZmMATEl, were found in tandem configuration. qALT6 was shown to increase 
Al tolerance in maize; the qALT6-H\Ls carrying three copies of ZmMATEl exhibited a two-fold increase in Al tolerance, 
and higher expression of ZmMATEl compared to the Al sensitive recurrent parent. Interestingly, a new source of Al 
tolerance via ZmMATEl was identified in a Brazilian elite line that showed high expression of ZmMATEl but carries a 
single copy of ZmMATEl. 

Conclusions: High ZmMATEl expression, controlled either by three copies of the target gene or by an unknown 
molecular mechanism, is responsible for Al tolerance mediated by qALT6. As Al tolerant alleles at qALT6 are rare in 
maize, marker-assisted introgression of this QTL is an important strategy to improve maize adaptation to acid soils 
worldwide. 
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Background 

Aluminum (Al) toxicity is one of the major constraints 
for crop production on acidic soils, which comprise over 
50% of the worlds potentially arable lands [1]. Under 
low soil pH, Al is solubilized into highly rhizotoxic ionic 
forms, primarily Al 3+ , that inhibit root growth, reducing 
water and nutrient uptake [2]. Although liming and 
other agronomic practices can ameliorate soil acidity, 
such technologies are not readily available for a large 
number of small-scale farmers particularly in the devel- 
oping world, where agriculture is the main source of in- 
come and food. In addition to naturally occurring acid 
soils, soil acidification has been reported after 21 years 
of no-tillage crop system in the south of Brazil [3], an 
agricultural production method now largely adopted 
worldwide [4]. Maize is the most produced cereal in the 
world and is widely used in animal feed, human food 
and industrial purposes. In addition, maize is a major 
staple food in Africa and Latin America where acid soils 
are common [5]. Therefore, efforts to elucidate the gen- 
etic and molecular mechanisms underlying maize Al tol- 
erance are expected to accelerate the development of 
Al-tolerant cultivars. 

Al tolerance has been shown to be controlled by major 
genes in wheat [6], sorghum [7,8], and barley [9]. These 
plant species were used to clone the first Al tolerance 
genes, which were involved in organic acid exudation to 
exclude Al from the root apex. The Al tolerance gene in 
wheat encodes an aluminum- activated malate trans- 
porter (TaALMTl) responsible for malate release from 
root apices [6]. Subsequently, ALMT1 homologs with a 
role in Al tolerance were identified in Arabidopsis [10], 
rape [11], and rye [12]. Another gene family involved in 
Al tolerance is the multidrug and toxic compound extru- 
sion (MATE) family that was originally associated with 
Al-activated citrate release in sorghum (SbMATE) [8] and 
barley (aluminum-activated citrate transporter 1, HvAACTl) 
[9]. In addition, MATE homologs confer Al tolerance in 
Arabidopsis (AtMATE) [13], wheat (TaMATEl) [14], 
rye (ScFRDL2) [15], maize (ZmMATEl) [16], and rice 
(OsFRDL4) [17]. 

Al tolerance is quantitatively inherited in Arabidopsis 
[18], maize [19-21], and rice [22-26]. Evidences for in- 
ternal Al detoxification as a complementary Al tolerance 
mechanism have been supported by the identification of 
genes encoding ATP binding cassette transporters in Ara- 
bidopsis (ALS1 and ALS3) [27,28] and rice (STAR1, STAR2 
[29]; OsALSl [30]). Additionally, Nramp aluminum trans- 
porter 1 (Nratl) is an aluminum transporter localized on 
the plasma membrane, which is required in the initial 
steps of internal Al detoxification in rice [31]. The expres- 
sion of many genes involved in Al tolerance is regulated 
by a Cys2-His2 zinc finger transcription factor in Arabi- 
dopsis (STOP1) [32,33] and in rice (ART1) [34]. In fact, 



ART1, STAR2 and Nratl co-localized with Al tolerance 
QTLs in rice [26], indicating that QTL mapping is a useful 
tool to help in the validation of candidate Al tolerance 
genes. 

Al tolerance is a complex trait in maize [35-37], with 
possible involvement of multiple genes and physiological 
mechanisms. One such mechanism is root citrate release 
[38,39], although other mechanisms are likely to contrib- 
ute to maize Al tolerance [40]. However, it is noteworthy 
that only a few QTLs [19-21] and candidate genes 
[16,41,42] have been reported for Al tolerance on this 
crop. Combining association and linkage mapping, four 
candidate genes were associated with Al tolerance in 
maize, Zea mays A/^-like (ZmASL), ZmALMT2, malic 
enzyme and S-adenosyl-L-homocysteinase [41]. How- 
ever, functional characterization of ZmALMT2 did not 
support a role in maize Al tolerance [43]. Two maize ho- 
mologs of sorghum SbMATE, ZmMATEl and ZmMATE2, 
co-localized with Al tolerance QTLs [16] that were consist- 
ently detected in other QTL studies [19,20]. ZmMATE2 
was not conclusively implicated in Al tolerance, but 
ZmMATEl is up-regulated by Al and encodes a membrane 
transporter responsible for Al-activated citrate exud- 
ation in root apices of Al-tolerant maize lines [16]. Re- 
cently, three copies of ZmMATEl were associated with 
high expression of this gene, which was proposed to 
underlie a major Al tolerance QTL mapped on maize 
chromosome 6 [42]. 

Here we performed a marker-trait association study with 
unprecedented marker density generated by genotyping- 
by-sequencing (GBS) [44] to provide a detailed description 
of the genetic complexity underlying maize Al tolerance. 
Five QTLs highly associated with Al tolerance were identi- 
fied with 54,455 single nucleotide polymorphisms (SNPs). 
Based on our results, ZmNratl, which encodes a protein 
sharing high amino acid sequence identity with the rice 
Al 3+ transporter, Nratl, is a candidate gene for further Al 
tolerance studies in maize. We also validated a major Al 
tolerance QTL, qALT6, which was able to improve Al tol- 
erance in maize NILs. Finally, a novel Al tolerance source 
in maize based on ZmMATEl was identified, which does 
not rely on copy number variation to maintain high levels 
of ZmMATEl expression. 

Results 

Multiple genomic regions are associated with Al tolerance 
in maize 

Significant genotypic differences for relative net root 
growth (RNRG) were detected in the RIL population, 
which was used to assess Al tolerance. Net root growth 
in control conditions (NRG_c) was also assessed as a 
measurement of intrinsic differences in root growth that 
are not necessarily related to Al tolerance (Table 1). The 
heritability estimates based on family means were 0.94 
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Table 1 Analyses of variance for relative net root growth 
and for net root growth in nutrient solution without Al in 
the recombinant inbred lines population 



Source of variation DF 


MS 






RNRG 


NRG_c 


Genotypes 1 1 7 


562.80** 


864.00** 


Residual 118 


36.13 


67.30 


Total 235 






Coefficient of experimental variation (CVe%) 


10.31 


8.27 


Coefficient of genetic variation (CVg%) 


27.84 


20.12 


Heritability 


0.94 


0.92 


CVg/CVe 


2.70 


2.43 



Relative net root growth: RNRG; Net root growth in nutrient solution without 
Al: NRG_c. 

DF: degrees of freedom; MS: mean square; ^significant at p < 0.01. 



for RNRG and 0.92 for NRG_c, with a coefficient of 
variation close to 10% for both traits (Table 1). 

Of the 458,255 SNPs generated with GBS, 54,455 
SNPs were selected after imputing and filtering process 
(Additional file 1: Table SI). The imputation procedure 
improved by 43% the global number of SNPs, when 
compared with the selection of markers with less than 
20% of missing data without imputing. Additionally, for 
chromosome 10, reducing MAF from 0.4 to 0.3 in- 
creased the number of SNPs from 694 to 2,803, indicat- 
ing a significant segregation distortion of GBS-based 
markers on this chromosome. 

The QTL mapping approach detected five genomic re- 
gions significantly associated with Al tolerance on maize 
chromosomes 2, 3, 5, 6, and 8. Together, these QTLs ex- 
plained approximately 63% of the total phenotypic variance 
for RNRG (Table 2 and Figure 1). Epistatic interactions 
were not detected with the current population size. Three 
candidate genes for Al tolerance previously described in 
maize, ZrnMATEl, ZmMATE2 [16] and ZmASL [41] were 
genetically mapped to their predicted physical position 
based on anchored flanking SNPs. Two major Al tolerance 



QTLs were found on chromosomes 3 (qALT3) and 6 
(qALT6), which individually explained -27 and 30% of 
the variation in RNRG, respectively (Table 2). All the 
alleles increasing Al tolerance were donated by the Al- 
tolerant parent. Context sequences and SNP markers 
flanking each QTL were provided for approximately 
200 bp windows defined based on the SNP markers 
with the highest -log 10 (P-value) for association with Al 
tolerance (Additional file 2: Table S2). 

Aluminum tolerance has been traditionally pheno- 
typed based on the degree of root growth inhibition in 
Al-treated plants, relative to a -Al control, to remove 
possible confounding effects caused by intrinsic differ- 
ences in root growth. To confirm that Al tolerance 
QTLs were not artifacts caused by variation in root 
growth unrelated to Al tolerance, QTLs were also 
mapped for NRG_c. Three genomic regions on chromo- 
somes 1, 3, and 6 were associated with NRG_c, explain- 
ing approximately 57% of the total phenotypic variance 
(Table 3). Only the root growth QTL on chromosome 3 
(qNRG_c3) was located near the Al tolerance QTL 
qALT3, but with opposite effects, indicating that for this 
QTL, selection to improve Al tolerance may cause a re- 
duction in root length. Nevertheless, the other Al toler- 
ance QTLs were not affected by root growth in the 
absence of Al. 

Candidate genes on qALT5 

qALTS explained 17.6% of the phenotypic variance for 
RNRG and spanned a genomic region from 18.70 to 
40.10 Mbp on chromosome 5. Based on the current an- 
notation of the B73 RefGen_v2 genome sequence 
(5b.60), 432 gene models are predicted in this region, of 
which 30 were covered by at least one SNP significantly 
associated with Al tolerance (Additional file 3: Table S3). 
Among these genes, we selected ZmMATE2 predicted 
at 20.60 Mbp, which was previously characterized as a 
membrane transporter and putatively associated with 
Al tolerance in maize [16]. Maize NILs on the Al 



Table 2 QTLs identified through a multi-SNP regression model fitted with relative net root growth (RNRG) data 



QTL 


Chr 


SNPJD 


Position (Mbp) 


-log 10 (P) 


Effect 


R P 2 (%) 


CI (Mbp) 


qALT2 


2 


S2_2 129405 14 


212.94 


4.83 


9.14 


15.47 


208.20 - 215.41 


qALT3 


3 


S3_1 87460236 


187.46 


8.68 


13.11 


27.51 


186.73 - 188.90 


qALTS 


5 


S5_30301926 


30.30 


5.46 


9.82 


17.56 


18.70 - 40.10 


qALT6 


6 


ZrnMATEl 


5.86 


9.74 


14.21 


30.54 


5.44 - 5.96 


qALT8 


8 


S8_22681622 


22.68 


6.17 


10.43 


19.86 


17.06 - 27.64 


R T 2 (%) 












62.78 





Chr: maize chromosome; SNPJD indicates the chromosome followed by the physical position in bp; ZrnMATEl is the sequence-tagged site marker developed 
within ZrnMATEl. 

The position of each QTL was determined in Mbp (Mega base pairs) as the highest -log 10 (P-value). 

R P 2 {%) is the percentage of the phenotypic variance explained by each QTL and R T 2 {%) is the percentage of the phenotypic variance explained by the full model, 
including all significant QTLs. 

CI is the 95% confidence interval of the QTL position. 
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Figure 1 Al tolerance QTLs detected using a multiple regression model. QTLs were assigned as significant at p < 0.001. 



sensitive background belonging to L53 introgressed with 
ZmMATE2 were as Al-sensitive as was L53 (Figure 2a), 
and showed similar ZmMATE2 expression levels to the 
parents (Figure 2b). ZmMATE2 was not differentially 
expressed between the parental lines (Figure 2b), as re- 
ported by Maron et al [16]. However, when the expres- 
sion pattern of ZmMATE2 was investigated in the RILs, 
one expression QTL (eQTL) explaining 19.9% of the vari- 
ance for ZmMATE2 expression was revealed at 204.66 
Mbp on chromosome 3 (Additional file 4: Figure Sla), a 
position different from the target gene on chromosome 5. 

Interestingly, another peak was detected close to 
qALT5, at 69.30 Mbp, but it was not significant in the 
multi-SNP regression model (Figure 1). In the vicinity of 
this putative peak, we identified the predicted gene 
GRMZM2G168747 at 74.61 Mbp, sharing 83% of amino 
acid sequence identity to rice Nratl. The tissue specifi- 
city of ZmNratl was then evaluated after 6 hours of Al 



exposure in both parental lines (Figure 3a). The results 
demonstrate that ZmNratl is highly induced by Al in 
root tips, moderately expressed in basal root segments 
(1 to 3 cm above root tip) and is not expressed in 
shoots. A detailed time course response to Al treatment 
of root tips revealed that ZmNratl was up-regulated by 
Al at very early stages in the tolerant line, reaching the 
maximum level after 1 hour of Al treatment, whereas in 
the Al-sensitive line the highest expression was achieved 
3 hours after exposure to Al (Figure 3b). 

Candidate genes on qALT6 

A highly significant QTL region was detected from 5.44 
to 5.96 Mbp (qALT6), explaining a large proportion of 
the variation in Al tolerance. This genomic region 
encompassed ZmMATEl although none of the 54,455 
SNPs tagged this candidate gene. Instead, a sequence- 
tagged site (STS) marker generated within ZmMATEl 



Table 3 QTLs identified through a multi-SNP regression model fitted with net root growth in nutrient solution without 
aluminum (NRG_c) data 



QTL 


Chr 


SNPJD 


Position (Mbp) 


-log 10 (P) 


Effect 


R P 2 (%) 


CI (Mbp) 


qNRG_d 


1 


S1_69700844 


69.70 


7.98 


-15.97 


25.06 


67.25 - 71.15 


qNRG_c3 


3 


S3_1 95644732 


195.64 


10.50 


-18.97 


32.18 


187.71 - 195.96 


qNRG_c6 


6 


S6_1 58261 558 


158.26 


10.20 


-18.46 


31.35 


156.99 - 158.45 


R T 2 (%) 












56.88 





Chr: maize chromosome; SNPJD indicate the chromosome and the physical position in bp. 

The position of each QTL was determined in Mbp (Mega base pairs) as the highest -log 10 (P-value). 

R P 2 (%) is the percentage of the phenotypic variance explained by each QTL and R T 2 (%) is the percentage of the phenotypic variance explained by the full model, 
including all significant QTLs. 

CI is the 95% confidence interval of the QTL position. 
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Figure 2 Near-isogenic lines introgressed with ZmMATE2, using Cateto AI237 and L53 as the donor and recurrent parents, respectively. 

A) Aluminum tolerance measured as relative seminal root growth after five days under {39} uM Al 3+ . B) ZmMATE2 relative expression evaluated in 
root tips after 6 hours of treatment with {39} uM Al 3+ . Expression of L53 was used as calibrator. Error bars indicate standard deviation. 
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Figure 3 Spatial and temporal expression profiles of ZmNratl in two contrasting maize lines for Al tolerance. A) Expression of ZmN 'rati 
in different tissues under controlled conditions, and after 6 hours at {39} uM Al 3+ . B) Time course of ZmN 'rot! expression following treatment of 
root tips with {39} uM Al 3+ . Expression of L53 under controlled condition (-AI) was used as calibrator. Error bars indicate standard deviation. 
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had the highest -logCP-value) for RNRG (Table 2). A single 
eQTL, explaining 70.9% of the variance for ZmMATEl ex- 
pression, overlapped the qALT6 region where ZmMATEl 
lies on chromosome 6 (Additional file 4: Figure Sib). This 
confirms previous results obtained at much lower SNP 
density, i.e. 1,894 SNPs on chromosome 6 [42]. 

The Cateto A1237 introgression region in the qALT6- 
NILs was estimated as 10 Mbp, extending from positions 
3.4 to 13.4 Mbp on chromosome 6. This region includes 
the STS marker for ZmMATEl at position 5.86 Mbp, 
which showed the highest -log(P-value) with RNRG 
within qALT6. The qALT6-NIL harboring the Cateto 
A1237 allele showed a two-fold increase in Al tolerance 
when compared to the L53 counterpart (Figure 4a). 
The former exhibited also high ZmMATEl expression 
(Figure 4b) and three copies of ZmMATEl, similarly to 
the donor parent Cateto A1237 (Figure 4c). 

Phylogeny of MATE members in maize 

Forty-three predicted MATE members found in the 
maize genome were clustered in two major groups, one 



of them comprising citrate transporters associated with Al 
tolerance in different plant species that included five 
maize MATEs (Additional file 5: Figure S2). These five 
predicted maize MATE proteins shared between 20 and 
69% sequence identity to SbMATE, including ZmMATEl 
and ZmASL, which was previously associated with Al 
tolerance [41] (Additional file 6: Table S4). Of the five can- 
didate MATE members analyzed, only ZmMATEl was 
co-localized with Al tolerance QTLs mapped in this study. 

Identification of high ZmMATEl expression in Brazilian 
maize lines 

High ZmMATEl expression is associated with superior 
Al tolerance in maize RILs and NILs harboring the 
Cateto A1237 allele at qALT6. Thus, ZmMATEl expres- 
sion pattern under Al stress was investigated in 36 maize 
lines derived from the Embrapa breeding program in- 
cluding the intermediately Al-tolerant, temperate maize 
line B73. The Embrapa elite lines showed a wide range 
of Al tolerance based on RNRG at {39} \iM Al 3+ , ranging 
from Al-sensitive (RNRG < 30%) to highly Al-tolerant 
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Figure 4 Near-isogenic lines introgressed with qALT6, using Cateto AI237 and L53 as the donor and recurrent parents, respectively. 

A) Aluminum tolerance measured as relative seminal root growth after five days under {39} uM Al 3+ . B) ZmMATEl relative expression evaluated in 
root tips after 6 hours of treatment with {39} uM Al 3+ (dark gray) and under control condition (light gray). Expression of L53 under controlled 
condition (-AI) was used as calibrator. C) ZmMATEl copy-number estimated based on qPCR. Error bars indicate standard deviation. 
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(RNRG>80%) (Figure 5). The broad sense heritability 
estimate for Al tolerance was 0.95, with an accuracy of 
0.97. In addition to the highly tolerant Cateto A1237, 
seven lines in the panel showed high ZmMATEl expres- 
sion (Figure 5). These were L228-3 as well as six elite 
lines derived from L228-3, all of which are intermedi- 
ately Al-tolerant. Interestingly, L228-3 and its derived 
elite lines did not carry the three copies of ZmMATEl 
reported for Cateto A1237 (Figure 6). The presence of a 
single copy of ZmMATEl in these maize lines was indi- 
cated by two qPCR assays, CNV2 and CNV4, amplifying 
a fragment of the second exon and the 3'UTR region of 
ZmMATEl, respectively [42]. Thus, high expression of 
ZmMATEl in these lines is likely mediated by a mech- 
anism other than copy-number variation as proposed for 
Cateto A1237 [42]. 

Discussion 

The genetic complexity of Al tolerance in maize was dis- 
sected using a large number of markers provided by next- 
generation sequencing in a RIL population derived from 
two highly contrasting maize lines. For QTL mapping, a 
procedure similar to multiple interval mapping was imple- 
mented, based on a multi-SNP regression model, which 
accommodated multiple QTLs simultaneously. 

Five genomic regions were found to be highly associ- 
ated with Al tolerance as measured by RNRG, including 



the novel QTL, qALT3. We also mapped a large-effect 
QTL for root growth under control (-A1) condition that 
was relatively close to qALT3 (qNRG_c3, at 195.64 Mbp 
on chromosome 3, Table 3). QTLs associated with root 
length were also previously mapped to a region sur- 
rounding qALT3 (171-179 Mbp on chromosome 3) 
[45,46]. Moreover, this region was flanked by two con- 
sensus QTLs, Rt-7 and Rt-8, which have been implicated 
in the control of several root traits in different maize 
populations [47]. Thus, although qALT3 does not over- 
lap with qNRG_c3, further use of this Al-tolerance QTL 
in maize breeding programs may lead to the introgres- 
sion of undesirable alleles for root length due to linkage 
drag. 

The other four Al tolerance QTLs coincided with re- 
gions previously reported to be related to maize Al toler- 
ance. The RFLP markers umcl22 and umc49, located at 
209.5 and 222.5 Mbp on chromosome 2, respectively, 
were associated with Al tolerance in five different maize 
crosses [48]. Also, an Al tolerance QTL was mapped in 
a genomic region equivalent to 198.8 - 211.2 Mbp on 
maize chromosome 2 [20]. Despite the low saturation of 
markers used in these studies, it is plausible that these 
regions could be overlapping to qALT2 (211.2 - 215.4 
Mbp). Similarly, the Al tolerance QTL identified on 
chromosome 8 by Ninamango- Cardenas et al [20] is 
likely to correspond to qALT8. 
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Figure 5 Aluminum tolerance and ZmMATEl relative expression among 37 maize lines. Aluminum tolerance measured as relative seminal root 
growth after five days under {39} uM Al 3+ and ZmMATEl relative expression evaluated in root tips after 6 hours of treatment with {39} uM Al 3+ . 
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Figure 6 ZmMATEl copy-number estimate based on quantitative PCR in Cateto AI237, L53, L228-3, L228-3-derived lines (line designations 
starting with numbers), and B73.The primers for CNV2 and CNV4 amplify a portion of exon 2 and 3'UTR region of ZmMATEl, respectively. 



The QTL, qALTS, reported here, was coincident with 
an Al tolerance QTL mapped on maize chromosome 5, 
which co-localized with ZmMATE2 [16]. This candidate 
gene was previously characterized as a putative anion 
transporter that was however unable to mediate citrate 
efflux [16]. This is consistent with the phylogenetic ana- 
lysis where ZmMATE2 was not included in the major 
group composed of all MATEs functionally identified as 
citrate transporters in other plants (Additional file 5: 
Figure S2). To test the hypothesis raised by Maron et al 
[16] that the Al-sensitive line, L53, would carry positive 
regulatory factors for ZmMATE2 expression, we mapped 
a trans-eQTL controlling part of ZmMATE2 expression 
on chromosome 3. Our data confirmed that maize NILs 
with the ZmMATE2 allele from Cateto A1237 in the gen- 
etic background of L53 showed the same ZmMATE2 ex- 
pression level as the parental lines. Additionally, these 
maize NILs were as Al-sensitive as the sensitive recur- 
rent line, L53. Thus, ZmMATE2 was constitutively 
expressed, independently of the source of transacting 
factors, either L53 or Cateto A1237. As ZmMATE2 
amino acid sequences from Cateto A1237 and L53 were 
identical [16], we can infer that ZmMATE2 is not the 
candidate gene controlling Al tolerance on qALTS, at 
least in our maize germplasm. 

Outside of the qALTS confidence interval but near a 
putative QTL peak, we identified a maize homologue to 
rice Nratl, named here as ZmNratl. OsNratl encodes 
an Nramp aluminum transporter involved in intracellu- 
lar Al 3+ uptake in rice [31]. In our study, ZmNratl was 
up-regulated by Al primarily in root tips, following an 
expression pattern similar to that observed for the rice 
OsNratl [31]. However, the main difference found be- 
tween the contrasting maize lines for Al tolerance was 
the early Al-induction of ZmNratl in the Al-tolerant 
Cateto A1237. A previous report based on microarray 



analysis indicated that ZmNratl was also induced by Al 
in root tips of the Al-tolerant Cateto 100-6 after 2, 6 
and 24 hours of Al treatment and only after 6 and 
24 hours in the Al-sensitive line (L53) [49]. Thus, these 
studies consistently showed that ZmNratl is rapidly in- 
duced by Al in tolerant maize lines (Cateto 100-6 and 
Cateto A1237), whereas induction in the Al-sensitive 
line, L53, is delayed. Functional evidence in rice knock- 
out mutants supported a role for Nratl in Al tolerance 
[31]. In addition, genomic regions surrounding this can- 
didate gene on rice chromosome 2 were also shown to 
contribute to natural variation for Al tolerance in a bi- 
parental QTL mapping [23] and in the aus subpopulation 
in a genome-wide association study [26]. Nevertheless, the 
Nratl QTL region explained a low percentage of Al toler- 
ance in a backcross introgression line population derived 
from temperate japonica x aus [23] and no QTL in this 
region was reported in a cross between tropical japonica 
and indica [26], suggesting that the effect of this QTL 
is dependent on the genetic background. The non- 
significance of the QTL peak spanning the ZmNratl 
region may be explained by: i) the presence of a signifi- 
cant QTL (qALTS) close to this candidate gene that 
may have retained the genetic variance of this sur- 
rounding region; and ii) the reduced population size, 
which limits the power to resolve two closely linked 
QTLs. However, the evidence presented here warrants 
further investigation on the involvement of ZmNratl 
on the internal Al detoxification mechanism in maize. 

The major Al tolerance QTL on chromosome 6 (qALT6) 
was also identified in other QTL studies [16,19,20]. 
Within this QTL region, we mapped ZmMATEl as well 
as a major eQTL controlling its expression. The overex- 
pression of ZmMATEl was shown previously to improve 
Al tolerance in transgenic Arabidopsis [16]. Here we dem- 
onstrate that the high expression of ZmMATEl almost 
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doubled the Al tolerance of maize NILs when qALT6 was 
introgressed into an Al-sensitive line. These NILs har- 
bored three copies of ZmMATEl, confirming that the 
functional allele derived from Cateto A1237 was trans- 
ferred. Interestingly, the expression pattern of ZmMATEl 
in maize NILs was quite different from SbMATE in sor- 
ghum NILs, which was down-regulated when different 
alleles were introgressed into a common, Al-sensitive 
background [50]. This reduction in SbMATE expression 
was closely correlated with a decrease in Al tolerance, sug- 
gesting an incomplete transfer of regulatory factors acting 
in trans on SbMATE [50]. Marker-assisted introgression 
of SbMATE would be dependent on the genetic back- 
ground and may require additional genomic regions carry- 
ing accessory factors necessary for the full expression of 
SbMATE. In maize, introgression of qALT6 seems to be 
enough for reaching full expression of ZmMATEl. Thus, 
our results, together with those presented in previous re- 
ports [16,42], strongly support that ZmMATEl is the gene 
underlying qALT6, which enhances Al tolerance through 
its up-regulation under Al stress. 

Cateto is a group of maize landraces from South 
America that were extensively adopted by native Indians 
and were also used as commercial varieties or in local 
hybrid programs [51]. Cateto lines were identified as 
highly Al-tolerant under field conditions [52] and in nu- 
trient solution [53,54] in the early 1980s. Since then, 
Cateto lines have been considered as major sources of 
Al tolerance in maize. Three Cateto lines have been the 
subject of detailed genetic, physiological, and molecular 
studies of Al tolerance: Cateto A1237 [16,20,42,49], 
Catl00-6 [19,42,49,55,56], and Cateto Colombia [39,40]. 
The high Al tolerance of these lines in nutrient solution 
was mainly associated with an Al-exclusion mechanism 
mediated by Al-activated citrate exudation in root tips 
[16,39,40,42]. Among those, only the native Brazilian 
lines, Cateto A1237 and Catl00-6, were confirmed to 
show high ZmMATEl expression controlled by three- 
copies of this gene, which is considered to be the major 
molecular mechanism underlying citrate exudation in 
those lines [42]. As Cateto landraces were widely culti- 
vated by natives and early farmers in South America 
[51], it is reasonable to expect that their extreme Al tol- 
erance accounts for their superior adaptation to acid 
soils. However, these landraces carry undesirable traits, 
reducing their potential for direct use in commercial 
maize breeding programs [57]. Thus, the introgression 
of Al tolerance alleles from native races into maize elite 
lines is an important target to improve maize stability in 
tropical soils. 

A further characterization of tropical maize lines re- 
vealed that L228-3 and its derived elite lines also pre- 
sented high ZmMATEl expression. However, in contrast 
to Cateto A1237 and Catl00-6 lines that carry this gene 



in triplicate [42], only a single copy of ZmMATEl was 
found in L228-3 and its derived lines. L228-3 is the main 
founder of the dent heterotic group from the Embrapa 
maize breeding program [58], which was derived from 
the open pollinated variety BR106, as a result of 13 se- 
lection cycles in the Tuxpeno Composite [59]. Elite dent 
lines extracted from BR 106 were used to develop com- 
mercial hybrids widely cultivated in Brazilian Cerrado, 
including BR201, a double cross hybrid that occupied up 
to 14% of the Brazilian seed market from 1989 to 1993 
[60]. We can infer that, in L228-3, a different molecular 
mechanism to copy number variation (CNV) would en- 
hance ZmMATEl expression under Al stress, conferring 
an important advantage to acid soils adaptation. 

Conclusions 

The high ZmMATEl expression controlled either by 
CNV or by another molecular mechanism that underlies 
qALT6, in addition to significantly improving Al toler- 
ance in maize, is rare and restricted to a few South 
American landraces, mainly from Brazil and Colombia. 
Thus, we may expect that a large proportion of modern 
maize lines that range from Al sensitive to intermediate 
in Al tolerance could be improved for Al tolerance via 
marker-assisted introgression of this genomic region. 

Methods 

Genetic stocks 

A population of 118 recombinant inbred lines (RILs) in 
S7, derived from a cross between Cateto A1237 and L53, 
was used for QTL mapping. Cateto A1237 and L53 have 
been characterized as highly Al tolerant and Al-sensitive, 
respectively. 

Assessment of aluminum tolerance in nutrient solution 

Experiments were performed in a growth chamber at 
27/20°C day/night temperatures, with light intensity of 
330 umol photons m -2 s _1 and a 12-h photoperiod. 
Maize seeds were surface-sterilized with 0.5% (w/v) 
NaOCl for 5 minutes, thoroughly rinsed in deionized 
water and germinated for four days on moistened ger- 
mination paper rolls. Seedlings were transferred to poly- 
ethylene cups placed into containers filled with 8.5 1 of 
nutrient solution at pH 4.0, according to Magnavaca 
et al. [35], under continuous aeration, and acclimatized 
in full nutrient solution for 24 h. After the initial root 
length (IRL) was measured, +/- Al treatments were im- 
posed by replacing the nutrient solution with nutrient 
solution of identical composition but with or without 
{39} uM Al 3+ activity supplied as KA1(S0 4 ) 2 . Free Al 3+ 
activities (number in brackets) were calculated using the 
GEOCHEM-EZ speciation software [61]. The solution 
pH was adjusted to pH 4.0. After five days, the final root 
length (FRL) of each plant was measured. Net root 
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growth (NRG) was calculated as NRG = FRL - IRL, ei- 
ther under Al treatment (NRG_A1) or under control 
condition, without Al (NRG_c). Finally, Al tolerance was 
expressed as Relative Net Root Growth (RNRG) esti- 
mated as RNRG = NRG_Al/NRG_c. 

Experiments were performed in a complete random- 
ized design with two replicates and seven plants per plot. 
Analysis of variance was performed using the Genes 
software [62]. 

Molecular markers 

Genomic DNA was isolated from leaf tissue using the 
CTAB protocol [63]. Sequence-tagged site (STS) markers 
for candidate genes were designed based on polymorphisms 
detected in the parents. ZmMATEl was genotyped with a 
129-bp indel amplified with the primers 5-CCGGATGTT 
TGCTGGATTTT-3 ' and 5 -TGGCCAAATCGACCATG 
ATT-3', and resolved on 1% (w/v) agarose gel stained with 
ethidium bromide. ZmMATE2 was mapped with a 10-bp 
indel detected with the fluorescently labeled primers, 5'- 
GCAGTTCGTACGTAGTGGTG-3 ' and 5 -AGTACGTA 
GCTAGGCGATGC-3 ', in the ABI3100 genetic analyzer 
(Applied Biosystems). For ZrnASL, the primers 5'-CCGG 
CACAGCAGTATCAAC-3 ' and 5 -TTGCTTTCCCCGA 
TAGAGAA-3' were designed based on reference se- 
quences obtained from the Maize Assembled Genomic 
Island (MAGI) Database [64], which detected a 25-bp indel 
in the parents. ZmNratl [MaizeGDB: GRMZM2G069198] 
was genotyped using a Cleaved Amplified Polymorphic 
Sequence (CAPs) marker based on a genomic fragment 
amplified with primers 5 -CGCGGAAACAGGAACCAA 
ACCAAAA-3' and 5 -CGGGTCTCTGCGTACCCCGA-3 ', 
cleaved with Hinfi. The STS and CAPs markers described 
above were genotyped on 1% (w/v) agarose gels. 

SNPs were genotyped using genotyping-by-sequencing 
as described in Elshire et al [44]. SNPs were designated 
with the letter S followed by the chromosome number and 
the physical position along each chromosome. Missing data 
imputation was performed with the software Npute [65]. 
SNP filtering was performed using the TASSEL software 
version 3.0.83 [66] with a minimum minor allele frequency 
(MAF) threshold of 0.4, except for chromosome 10 that 
was filtered using a minimum MAF of 0.3, due to a high 
percentage of SNP markers showing segregation distortion. 

QTL mapping 

The procedure used for QTL mapping was similar to 
multiple interval mapping [67], which allows fitting mul- 
tiple QTL simultaneously with main effects and epistatic 
interactions. The high marker density provided by GBS 
precluded the need for estimating conditional probabil- 
ities of marker genotypes. 

First, a single marker analysis was performed using a 
linear regression model. Markers associated with Al 



tolerance were declared considering a p < 0.001 thresh- 
old. Next, a multiple regression model was fitted by add- 
ing the most significant SNP to the model selected in 
the preceding step. This was done in a stepwise fashion 
until no more significant SNPs were found. Finally, we 
tested all pairwise epistatic interactions among SNPs 
that were kept in the model as well as between those 
and all other SNPs. 

SNP effects, partial (Rp) and total (R%) coefficients of 
determination were estimated from a multi-SNP regres- 
sion model. R 2 T was calculated from the full model in- 
cluding all SNP effects, whereas Rp was estimated for 
each SNP as follows. For a given significant SNP, we esti- 
mated the sum of squares of residuals from the reduced 
model (SSErm) without the SNP effect and the sum of 
squares of residuals from the full model (SSEfm). Rp was 
estimated as l-(SSEfmlSSErm). All QTL mapping pro- 
cedure was implemented in the software R [68]. 

Information of SNP markers flanking each QTL were ob- 
tained from Panzea (http://www.panzea.org/db/searches/ 
webform/marker_search). Markers were selected in a win- 
dow size of approximately 200 bp to the SNP associated 
with Al tolerance at highest -log 10 (P-value). 

Development of NILs for the two major Al tolerance QTL 

RIL-150 and RIL-84 were selected based on the presence 
of the Cateto A1237 allele at the Al tolerance QTLs, 
qALT6 and qALT5, respectively. These RILs were crossed 
with the Al sensitive recurrent parent, L53, following 
two cycles of marker-assisted backcrosses and selfing 
until BC 2 F 3 progeny fixed for the Al tolerance QTL were 
obtained. For qALT6, foreground selection was based on 
a STS marker for ZmMATEl and the SSR marker, 
umcl018, whereas for qALTS we used only the STS 
marker for ZmMATE2. Background selection was per- 
formed using -30 simple sequence repeats (SSR) 
markers randomly distributed along the maize genome, 
which allowed for the selection of NILs with -98% of 
the L53 genome. NILs for qALT6 were genotyped-by- 
sequencing to evaluate the size of the Cateto A1237 
introgression. 

Expression pattern of ZmMATEl , ZmMATE2 and ZmNratl 

Temporal and spatial expression profiles of the candi- 
date genes were obtained using quantitative real-time 
PCR (qPCR), after 6 hours of Al treatment. For this ana- 
lyses, the first centimeter of root apices were sampled as 
previously described [16]. Expression of ZmNratl was 
also assessed in a root segment extending from 1 to 
3 cm from the root tip, which was designated as basal 
root segment, and in shoot tissues. The time-course pro- 
file for ZmNratl expression in root tips was obtained 
after 1, 3, 6 and 24 hours of exposure to {39} \iM Al 3+ , 
and compared to the control lacking Al. Total RNA was 
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extracted using the RNeasy Plant Mini Kit (Qiagen) ac- 
cording to the manufacturers instructions. First-strand 
cDNA was synthesized using the High Capacity cDNA 
Reverse Transcription Kit (Life Technologies). ZmMATEl 
expression was determined using a custom-designed Taq- 
Man assay consisting of forward primer 5'-CACCCGCT 
TAGCGTATTCCT-3 ', reverse primer 5-GCACCGCGA 
TCCTCATGAT-3 ' and probe 5 -TCTGAATGCGAGCC 
TCG-3', with a pre-designed TaqMan assay for Eukaryotic 
18S (Life Technologies) as endogenous control [42]. 
ZmMATE2 expression was evaluated using a TaqMan 
assay according to Maron et al [16]. The expression pro- 
file of ZmNratl was conducted using the Fast SYBR® 
Green Master Mix (Life Technologies) with primers F: 5'- 
CGCGCTTCTGATCCAAACA-3 ' and R: 5-GCGAGA 
TGCTTGCCTGTCTT-3 ', with 18S RNA as endogenous 
control. qPCR reactions for ZmMATE2 and ZmNratl 
were conducted in an ABI 7500 Fast (Life Technologies). 
Each biological sample was composed by 21 plants and 3 
technical replicates were adopted. Raw data were collected 
using the RQ Manager software (Life Technologies) and 
the relative expression levels were calculated using the 
ddCT method [69]. Relative expression data for ZmMATEl 
and ZmMATE2 were used for expression QTL (eQTL) 
mapping. 

Copy number quantification for ZmMATEl 

Copy number for ZmMATEl was estimated by qPCR 
using two primer pairs designed by Maron et al [42] 
with the Fast SYBR® Green Master Mix in an ABI 7500 
Fast (Life Technologies). The CNV2 assay was per- 
formed with primers F: 5-CCAGGCTCGCATTCA 
GATG-3'and R: 5 -GCACCGCGATCCTCATG-3 ', and 
CNV4 with F: 5 -TGTGAGTTTGGCGGATGTGT-3' 
and R: 5 -TCACAATCTAGGCCAGTACAACAGA-3 
both using the actin gene as a single copy control [42]. 
After optimization, the reactions were performed using 
6 ng of genomic DNA with 0.6 pmol and 0.3 pmol of 
primers for CNV2/CNV4 and the control, respectively. 
Three technical replicates were performed for each sam- 
ple and assay. Relative quantification (RQ) was calcu- 
lated by dividing the target sample quantity by the 
control sample quantity mean, then normalizing against 
the calibrator sample, L53, using the ddCT method [69]. 

Phylogenetic studies of MATE genes in the maize genome 

MATE proteins in the maize genome were identified by se- 
quence similarity analysis using the Sorghum bicolor 
SbMATE amino acid sequence [GenBank: ABS89149.1] as 
query in the B73 RefGen_v2 filtered set (Maize Genome 
Sequencing Project, release 5b.60, www.maizesequence. 
org). The Peptide Homologs tool in Phytozome was used 
to confirm that all predicted maize MATEs were identified. 



The amino acid sequences of maize MATEs as well as 
the MATE genes previously identified as Al tolerance 
genes in other crops, such as sorghum {SbMATE), Ara- 
bidopsis (AtMATE), rice (OsFRDL4), barley (HvAACTl), 
rye (ScFRDL2) and wheat (TaMATEl), were aligned 
using the Advanced M-COFFEE package available at T- 
COFFEE (www.tcoffee.org). The phylogenetic tree was 
constructed using maximum parsimony and 500 boot- 
straps with MEGA5 [70]. 

Al tolerance in a maize panel 

Thirty six tropical maize inbred lines developed by the 
Embrapa Maize and Sorghum (Sete Lagoas, Brazil) 
breeding program, as well as the temperate maize line 
B73, were evaluated for Al tolerance. Linear mixed 
model analysis was performed using an incomplete block 
design with common checks and three replicates, con- 
sidering genotypes and blocks as random effects and 
experiment as a fixed factor. The experimental unit con- 
sisted of seven plants per genotype. Two maize lines, 
L53 and Cateto A1237, were used as common checks in 
each experiment. The linear mixed model analysis was 
performed in the software SELEGEN-REML/BLUP [71] 
according to the model: y = Xt + Zg + Wb + e, where: y is 
the vector of phenotypes; t is the vector of experimental 
effects (fixed) plus the overall mean; g is the vector of 
genotypic effects (random); b is the vector of block ef- 
fects (random); and e is the error vector (random). X, Z 
and W represent the incidence matrices for the effects 
of t, g and b, respectively. The means and variances were 
structured and distributed as follows: 

y/t, V~N(Xt, V) 

g/a>~N(Q,I<$) 

b/ol~N(0,Iol) 
e/o 2 e ~N(0,lo 2 e ) 

The genotypic values (\i + g) were used to characterize 
the Al tolerance of these lines, and the confidence inter- 
vals (CI) were estimated using the standard error of the 
predicted genotypic values (SEP), which corresponded to 
the squared root of predicted error variance (PEV), ac- 
cording to the expression: IC = (\i + g) ± t x SEP, where: t 
(0.95) = 1-96 is the value of Student distribution at 95% of 
confidence interval. Non-overlapping confidence inter- 
vals indicate statistical differences of predicted genotypic 
effects in multiple comparisons. 

Additional files 



Additional file 1: Table SI. Genotyping-by-sequencing SNPs generated 
across the 10 maize chromosomes in the recombinant inbred line 
population. 
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Additional file 2: Table S2. SNP markers flanking each Al tolerance 
QTL in a windows size of approximately 200 bp to the SNP associated 
with Al tolerance at highest P-value. 

Additional file 3: Table S3. Predicted genes from 18.70 to 40.10 Mbp 
on chromosome 5, comprising the qALT5. 

Additional file 4: Figure SI. Expression QTLs (eQTLs) of ZmMATE2 and 
ZmMATEl on chromosomes 3 and 6, respectively. 

Additional file 5: Figure S2. Maximum likelihood phylogenetic tree of 
maize MATE members and MATE proteins characterized as citrate 
transporters in other plant species. Numbers in the nods indicate 
bootstrap values calculated using 500 resampling. 

Additional file 6: Table S4. Predicted maize MATE members clustered 
with citrate transporter from other plants, aminoacid sequence identity to 
SbMATE and the predicted physical position on the maize genome. 
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